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C/3 . A novel integer sorting technique was proposed replacing bucket sort, distribu- 

te. ' 

tion counting sort and address calculation sort family of algorithms which requires 
only constant amount of additional memory. The technique was inspired from one 
Q , of the ordinal theories of "serial order in behavior" and explained by the analogy 

with the three main stages in the formation and retrieval of memory in cognitive 
neuroscience namely (i) practicing, (ii) storing and (iii) retrieval. 
■ In this study, the technique is improved both theoretically and practically and an 

oc ; 

\Q . algorithm is obtained which is faster than the former making it more competitive. 

^r) " With the improved version, n integers S[0 . . . n — 1] each in the range [0, n — 1] are 

ov 

o: 

(N 



sorted exactly in 0{n) time while the complexity of the former technique was the 
recursion T{n) = T(^) + Oin) yielding T(n) = 0{n). 



1 Introduction 



Nervous system is considered to be closely related and described with the "serial order 
in behavior" in cognitive neuroscience [HE] with three basic theories which cover almost 
all abstract data types used in computer science. These are [3] chaining theory, positional 
theory and ordinal theory. 

Chaining theory is the extension of stimulus-response (reflex chain) theory, where 
each response can become the stimulus for the next. From an information processing 
perspective, comparison based sorting algorithms that sort the lists by making a series 
of decisions relying on comparing keys can be classified under chaining theory. Each 
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comparison becomes the stimulus for the next. Hence, keys themselves are associated 
with each other. Some important examples are quick sort [3], shell sort [5], merge sort [B] 
and heap sort [7J. 

Positional theory assumes order is stored by associating each element with its position 
in the sequence. The order is retrieved by using each position to cue its associated element. 
This is the method by which conventional (Von Neumann) computers store and retrieve 
order, through routines accessing separate addresses in memory. Content-based sorting 
algorithms where decisions rely on the contents of the keys can be classified under this 
theory. Each key is associated with a position depending on its content. Some important 
examples are distribution counting sort [SI IS], address calculation sort [TUHTo] . bucket 
sort [THE] an d radix sort pT6lTl9] . 

Ordinal theory assumes order is stored along a single dimension, where that order is 
defined by relative rather than absolute values on that dimension. Order can be retrieved 
by moving along the dimension in one or the other direction. This theory need not assume 
either the item-item nor position-item associations of the previous theories. 

One of the ordinal theories of serial order in behavior is that of Shiffrin and Cook 
[2TJj which suggests a model for short-term forgetting of item and order information of 
the brain. It assumes associations between elements and a "node", but only the nodes 
are associated with one another. By moving inwards from nodes representing the start 
and end of the sequence, the associations between nodes allow the order of items to be 
reconstructed [3]. 

The main difficulties of all distributive sorting algorithms is that, when the keys are 
distributed using a hash function according to their content, several of them may be 
clustered around a loci, and several may be mapped to the same location. These problems 
are solved by inherent three basic steps of associative sort [21] (i) practicing, (ii) storing 
and (iii) retrieval, which are the three main stages in the formation and retrieval of 
memory in cognitive neuroscience. 

1.1 Original Technique 

As in the ordinal model of Shiffrin and Cook, it is assumed that associations are between 
the integers in the list space and the nodes in an imaginary linear subspace that spans a 
predefined range of integers. The imaginary subspace can be defined anywhere on the list 
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space S[0 . . .n — 1] provided that its boundaries do not cross over that of the list. The 
range of the integers spanned by the imaginary subspace is upper bounded by the number 
of integers n but may be smaller and can be located anywhere making the technique in- 
place, i.e., beside the input list, only a constant amount of memory locations are used for 
storing counters and indices. Furthermore, this definition reveals the asymptotic power 
of the technique with increasing n with respect to the range of integers, as well. 

An association between an integer and the imaginary subspace is created by a node 
using a monotone bijective hash function that maps the integers in the predefined interval 
to the imaginary subspace. The process of creating a node by mapping a distinct integer to 
the imaginary subspace is "practicing a distinct integer of an interval". Since imaginary 
subspace is defined on the list space, this is just swapping. Once a node is created, 
the redundancy due to the association between the integer and the position of the node 
releases the word allocated to the integer in the physical memory except for one bit which 
tags the word as a node for interrogation purposes. All the bits of the node except 
the tag bit can be cleared and used to encode any information. Hence, they are the 
"record" of the node and the information encoded into a record is the "cue" by which 
cognitive neuro-scientists describe the way that the brain recalls the successive items 
in an order during retrieval. For instance, it will be foreknown from the tag bit that 
a node has already been created while another occurrence of that particular integer is 
being practiced providing the opportunity to count other occurrences. The process of 
counting other occurrences of a particular integer is "practicing all the integers of an 
interval", i.e., rehearsing used by cognitive neuro-scientists to describe the way that the 
brain manipulates the sequence before storing in a short (or long) term memory. On the 
other hand, the tag bit discriminates the word as node and the position of the node lets 
the integer be retrieved back from the imaginary subspace using the inverse hash function. 

Practicing does not need to alter the value of other occurrences, i.e., only the first 
occurrence is altered while being practiced from where a node is created. All other 
occurrences of that particular integer remain in the list space but become meaningless. 
Hence they are "idle integers" . On the other hand, practicing does not need to alter the 
position of idle integers as well, unless another distinct integer creates a node exactly 
at the position of an idle integer while being practiced. In such a case, the idle integer 
is moved to the former position of the integer that creates the new node. This makes 
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associative sort unstable, i.e., equal integers may not retain their original relative order. 
However, an imaginary subspace can create other subspaces and associations using the 
idle integers that were already practiced by manipulating either their position or value or 
both. Hence, a part of linear algebra and related fields of mathematics can be applied on 
subspaces to solve such problems. 

Once all the integers in the predefined interval are practiced, the nodes that are dis- 
persed in the imaginary subspace are clustered in a systematic way, i.e., the distance 
between the nodes are closed to a direction retaining their relative order. This is the 
storing phase of associative sort where the received, processed and combined information 
to construct the sorted permutation of the practiced interval is stored in the short-term 
memory (for instance, beginning of the list). When the nodes are moved towards a direc- 
tion, it is not possible to retain the association between the imaginary subspace and list 
space. However, the record of a node can be further used to encode the absolute position 
of that node as well, or maybe the relative position or how much that node is moved 
relative to its absolute or relative position during storing. Unfortunately, this requires 
that a record is enough to store both the position of the node and the number of idle 
integers practiced by that node. However, as explained earlier, further associations can 
be created using the idle integers that were already practiced by manipulating either their 
position or value or both. Hence, if the record is enough, it can store both the positional 
information and the number of idle integers. If not, an idle integer can be associated 
accompanying the node to supply additional space for it for the positional information. 

Finally, the sorted permutation of the practiced interval is constructed in the list 
space, using the stored information in the short-term memory. This is the retrieval phase 
of associative sort that depends on the information encoded into the record of a node. 
If the record is enough, it stores both the position of the node and the number of idle 
integers. If not, an associated idle integer accompanying the node stores the position of 
the node while the record holds the number of idle integers. The positional information 
cues the recall of the integer using the inverse hash function. This is "integer retrieval" 
from imaginary subpace. Hence, the retrieved integer can be copied on the list space as 
much as it occurrs. 

Hence, moving through nodes that represent the start and end of practiced integers 
as well as retaining their relative associations with each other even when their positions 
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are altered by cuing allow the order of integers to be constructed in linear time in-place. 

The adjective "associative" derived from two facts where the first one mentioned above 
describes the technique. The second one is that, although associative sort replaces all 
derivatives of the content based sorting algorithms such as distribution counting sort 
address calculation sort pTJHT5] and bucket sort [T6l[T7] on a RAM, it seems to be more 
efficient on a "content addressable memory" (CAM) known as "associative memory" which 
in one word time find a matching segment in tag portion of the word and reaches the 
remainder of the word [22] • For associative sort developed on a RAM, the nodes of the 
imaginary subspace (tagged words) and the integers of the list space (untagged words) 
are processed sequentially which will be a matter of one word time for a CAM to retrieve 
previous or next tagged or untagged word. 

The technique seems to be efficient and applicable for other problems, as well, such 
as hashing, searching, element distinction, succinct data structures, gaining space, etc. 
For instance, there are several space gaining techniques available and widely used in the 
literature for in-place and minimum space algorithms [231426] . However, as known to 
the author, all these in-place and minimum space algorithms have a dedicated explicit 
technique that is used only for space gaining purpose. On the contrary, gaining space 
is an inherent step of associative sort which improves its performance and can be used 
explicitly. 

From complexity point of view, associative sort shows similar characteristics with 
bucket sort and distribution counting sort. Hence, it can be thought of as in-place asso- 
ciative bucket sort or in-place associative distribution counting sort. Distribution counting 
sort is seldom discussed in the literature although it has been around more than 50 years 
since proposed by Seward [5] in 1954 and by Feurzig [9] in 1960, independently, and known 
to be the method that makes radix sort possible on digital computer. It is known to be 
very powerful when the integers have small range. Given n integers S[0 . . . n — 1] each in 
the range [0, m — 1], its time-complexity is 0(n + m) and requires n + m additional space 
for a stable and m for an unstable sort. Hence, distribution counting sort becomes effi- 
cient and practical when m = 0(n) defining its time-space trade-offs. On the other hand, 
bucket sort is a generalization of distribution counting sort. In fact, if each bucket has 
size 1, then bucket sort degenerates to distribution counting sort. However, the variable 
bucket size allows it to use 0{n) memory instead of 0{m + n) memory. Its average case 
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time complexity is 0(n + m) and if m = 0(n), then it becomes 0(n). Its worst case time 
complexity is 0(n 2 ). 

1.2 Improved Technique 

In this study, with a simple revision, the associative sorting technique is improved both 
theoretically and practically and a faster technique is achieved. During storing where 
the nodes are clustered at the beginning of the list retaining their relative order, the 
positional information (logn bits) of a node is encoded into either its record or an idle- 
integer accompanying the node. However, the tag bit discriminates the word as a node 
in the list space and if ignored during storing it will continue to discriminate the word 
as a node. This means that, if only the records (w — 1 bits) of the nodes are clustered 
at the beginning of the list (short-term memory) retaining their relative order, there 
will be rid nodes dispersed in the list space, and rid records in the short-term memory 
(S[0, . . .ri d — 1]) after storing. Hence, a one-to-one correspondence is obtained with the 
clustered records and the nodes (tagged words) of the list. Therefore, retrieval phase can 
search the list from right to left for the first tagged word, retrieve the integer from the 
imaginary subspace through that node, read its number of occurrence from its record 
S[rid — 1] in the short-term memory and expand it over the list starting at S[rid + n c — 1] 
where n c is the number of idle integers. Afterwards, the processed tag bit can be cleared 
and a new search to the left can be carried for the next tagged word which will correspond 
to the next record S[rid — 2] of the short-term memory. This can continue until all the 
integers are retrieved from short-term memory resulting in the sorted permutation of the 
practiced integers. 

While the former technique was capable of sorting integers that satisfy S[i] — <5 + e < n 
where 5 = min(S) and e G [0, |] is defined by Eqn.4.2 [5TJ, the improved version sorts 
the integers that satisfy S[i] — 5 < n. Hence, n integers S[0, . . .n — 1] each in the range 
[0, n — 1] will be sorted exactly in 0(n) time while the complexity of the former technique 
was the recursion T(n) = T(|) + 0(n) yielding T(n) = 0(n). 

With this introductory information, the contribution of this study is, 

A practical sorting algorithm that sorts n integers S[0 . . .n — 1] each in the range 
[0, m— 1] using 0(1) extra space in 0(n+m) time for the worst, 0(m) time for the average 
(uniformly distributed integers) and 0(n) time for the best case. The ratio — defines the 
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efficiency (time-space trade-offs) of the algorithm letting very larges lists to be sorted in- 
place. The algorithm is simple and practical replacing bucket sort, distribution counting 
sort and address calculation sort family of algorithms improving the space requirement 
to only 0(1) extra words. 

Practical comparisons for 1 million 32 bit integers with quick sort showed that associative 
sort is roughly 3 times faster for uniformly distributed integers when m = n. When 
— = 10 performances are same. When ^ = associative sort becomes roughly 4 times 
faster than quick sort. If the distribution is exponential, associative sort shows better 
performance up to — ~ 25 when compared with quick sort. 

Practical comparisons for 1 million 32 bit integers showed that radix sort is 2 times faster 
for uniformly distributed integers when m = n. However, associative sort is 2 times faster 
than radix sort when — = Further decreasing the ratio to — = ™, associative sort 

n 10 n 100 ' 

becomes more than 3 times faster than radix sort. 

Practical comparisons for 1 million 32 bit integers showed that bucket sort with each 
bucket of size one (hence distribution counting sort) is 2 times faster than associative sort 
for ^ = 1. Bucket sort is still slightly better but the performances get closer when ~ < ^ 
and 2* > 10. 

n 

Even omitting its space efficiency for a moment, associative sort asymptotically outper- 
forms all content based sorting algorithms when n is large relative to m. 

2 Definitions 

Given a list S of n integers, S[0], S[l], . . . , S[n — 1], the problem is to sort the integers in 
ascending or descending order. The notations used throughout the study are: 

(i) Universe of integers is assumed U = [0 ... 2 W — 1] where w is the fixed word length. 

(ii) Maximum and minimum integers of a list are, max(S') = max(a|a G S) and 
min(S') = min(a|a G S), respectively. Hence, range of the integers is, m = 
max(S') — min(S') + 1. 

(iii) The notation B C A is used to indicated that B is a proper subset of A. 

(iv) For two lists S\ and S2, max(S'i) < min(S < 2) implies S\ < S2. 

Universe of integers. When an integer is first practiced, a node is created releasing w 
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bits of the integer free. One bit is used to tag the word as a node. Hence, it is reasonable 
to doubt that the tag bit limits the universe of integers because all the integers should be 
untagged and in the range [0, 2 W ~ 1 — 1] before being practiced. But, we can, 

(i) partition S into 2 disjoint sublists Si < 2 w ~ l < S2 in 0(n) time with well known 
in-place partitioning algorithms as well as stably with [25] . 

(ii) shift all the integers of S 2 by —2 W ~ 1 , sort Si and S 2 associatively and shift S 2 by 

2 w-l_ 

There are other methods to overcome this problem. For instance, 

(i) sort the sublist 5[0 ... (n/ logn) — 1] using the optimal in-place merge sort [27], 

(ii) compress S[0 . . . inj logn) — 1] by Lemma 1 of [23] generating Q(n) free bits, 
(hi) sort S[(n/ logn) . . .n — 1] associatively using Vt{n) free bits as tag bits, 

(iv) uncompress S[0 . . . (n/ log n) — 1] and merge the two sorted sublists in-place in linear 
time by [27]. 

Number of integers. If practicing a distinct integer lets us to use w — 1 bits to practice 
other occurrences of that integer, we have w — 1 free bits by which we can count up to 
2™" 1 occurrences including the first integer that created the node. Hence, it is reasonable 
to doubt again that there is another restriction on the size of the lists, i.e., n < 2 w ~ l . But 
a list can be divided into two parts in 0(1) time and those parts can be merged in-place 
in linear time by [27] after sorted associatively. 

Hence, for the sake of simplicity, it will be assumed that n < 2 W ~ 1 and all the integers 
are in the range [0, 2 1 " -1 — 1] throughout the study. 

3 Sorting n Integers 

In this section, the improved associative sorting technique based on the three basic steps 
namely (i) practicing, (ii) storing and (iii) retrieval will be introduced. 

Lemma 3.1. Given n <= 2 W ~ 1 integers S[0...n — 1] each in the range [O^'" -1 — 1], all 
the integers in the range [8, 8 + n — 1] where 5 = min(S') can be sorted associatively at the 
beginning of the list in 0(n) time using 0(1) constant space. 

Proof. Given n <= 2 w ~ l distinct integers S[0...n — 1] each in the range [0, 2 w ~ l — 1], it 
is not possible to construct a monotone bijective hash function that maps all the integers 
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of the list into j G [0,n — 1] without additional storage space. At least Q(n + log to) bits 
are required for a minimal perfect hash function [2H] which is not bijective and monotone. 
However, a monotone bijective hash function can be constructed as a partial function [22] 
that assigns each integer of Si C S in the range [8, 8 + n — 1] with 5 = min(S) to exactly 
one element in j e [0, n — 1]. The partial monotone bijective hash function of this form 
is. 

j = S[i]-6 if S[i]-6<n (3.1) 
With this definition, the proof has three basic steps of associative sort: 

(i) Practice all the integers of the interval [8, 8+n — 1] into lm[0 . . . n — l] over S[0 . . . n — 
!]• 

(ii) Store only the records (w — 1 bits) of the nodes at the beginning of the list (short- 
term memory) retaining their relative order. Hence, a one-to-one correspondence is 
obtained with the stored records and the nodes (tagged words) of the list. 

(iii) Retrieve the sorted permutation of the practiced interval by searching the tagged 
words of the list backwards to retrieve the integers from the imaginary subspace. 
When an integer is retrieved from the imaginary subspace, read the number of 
occurrence of that integer from the corresponding record of the short-term memory 
and expand over the list backwards as many as it occurs. 

□ 

3.1 Practicing Phase 

Details of practicing phase can be found in [2TJ . 

Algorithm A. Practice all the integers of the interval [8, 8 + n — 1] into the imaginary 
subspace im[0 . . .n — l] over S[0 . . .n — l] using Eqn. 13.11 It is assumed that the minimum 
of the list 8 = min(S') is known. 

Al. initialize % — 0; 

A2. if MSB of S[i] is 1, then S[i] is a node. Hence, increase % and repeat this step; 
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A3, if S[i] —8 > n, then S[i] is an integer of S2 that is out of the practiced interval. Hence, 
increase n' d that counts the number of integers of S 2 , update 5' = min(S' , S[i}), 
increase % and goto to step IA2j 

A4. otherwise, S[i] is an integer to be practiced. Hence, calculate j = S[i] —5 (Eqn. l3TTj) : 

A5. if MSB of S[j] is 0, then S[i] is the first integer that will create the node at j. Move 
S[j] to S[i], clear S[j] and set its MSB to 1 making it a node. If j < i increase i. 
Increase rid that counts the number of distinct integers (nodes), and goto step IA2( 

A6. otherwise, S[j] is a node that has already been created. Hence, clear MSB of S[j], 
increase S[j] (number of idle integers) and set its MSB back to 1. Increase % and n c 
that counts the number of total idle integers over all distinct integers and goto step 

El 

3.2 Storing Phase 

Practicing creates rid nodes and n c idle integers. This means rid integers of Si are mapped 
into the imaginary subspace creating nodes that are dispersed with relative order in 
lm[0 ... n — 1] over S[0 ... n — 1] depending on the statistical distribution of the integers. 
On the other hand, n c idle integers of Si are distributed disorderly together with n' d 
integers of S 2 in the list space. 

In storing phase, the records are clustered in a systematic way, i.e., the distance 
between the records of the imaginary subspace are closed to a direction (beginning of 
the list) without altering their relative order with respect to each other. As long as the 
position of the tag bits are not altered, the association between the imaginary subspace 
and the list space is retained. 

Algorithm B. Store the records of the practiced interval in the short term memory. 
Bl. initialize i = 0, j — 0, k — rid, 

B2. if MSB of S[i] is 0, then S[i] is either an idle integer or an integer of S2 that is out 
of the practiced interval. Hence, increase i and repeat this step; 

B3. otherwise, S[i] is a node. Hence, swap least significant w — 1 bits of S[i] with least 
significant w — 1 bits of S[j]. Increase i and j and decrease k. If k = exit, otherwise 
goto step IB2t 
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3.3 Retrieval Phase 

Storing clusters rid records of the nodes at S[0 . . . n d — 1]. Hence, S[0 ... — 1] can be 
though of as a short-term memory where the encoded information of the rid+n c integers of 
the practiced interval is stored. The stored information is the number of occurrences of the 
distinct integers. However, the tagged words of the list have a one-to-one correspondence 
with these rid records from left to right or vice versa. 

In retrieval phase, the stored information is retrieved from the short term memory 
S[0 . . .rid ~ 1] to construct the sorted permutation of the practiced interval. The short 
term memory encodes rid + n c integers of Si with rid permanent records. Each record 
stores the number of occurrence of a particular integer. On the other hand, the particular 
integer can be retrieved back to list space through its corresponding node. It is important 
to note that, if the number of occurrences of a particular integer is n.j, then there are 
n i — 1 idle integers in the list. But the record itself represents the integer that is mapped 
into the imaginary subspace. Hence, it is immediate from this definition that the list can 
be searched from right to left backwards for the first tagged word to retrieve the integer 
back to list space (using the inverse hash function) and the integer can be copied as many 
as it occurs which can be read from the record S[rid — 1] of the node in the short term 
memory S[0 . . . n^ — 1]. Afterwards, the processed tag bit can be cleared and a new search 
to the right can be carried for the next tagged word which will correspond to the next 
record S[rid — 2] of the short-term memory. This can continue until all the integers are 
retrieved resulting in the sorted permutation of the practiced integers. 

It should be noted that, n c idle integers of Si and n' d integers of S 2 are distributed 
disorderly together at S[n d . . . n — 1]. Hence, before proceeding, n c idle integers should be 
clustered at the beginning of S[n d . . .n — 1]. Afterwards, the retrieval phase can begin. 
This is a simple partitioning problem where the tag bits should be taken under care. 

Algorithm C. Partition S[n<f ... n — 1] to cluster n c idle integers to the beginning. 

CI. initialize % = rid, j — n d , k = n c ; 

C2. Read w — 1 bits of S[i] into s. If s — 5 >= n, then S[i] is an integer of S2 that is out 
of the practiced interval. Hence, increase % and repeat this step; 

C3. otherwise, S[i] is an idle integer. Hence, swap least significant w — 1 bits of S[i] with 
least significant w — 1 bits of S[j]. Increase i and j and decrease k. If k = exit, 
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otherwise goto step IC2( 



Algorithm D. Process the list from right to left to find a node, retrieve the integer from 
the imaginary subspace through that node, read the number of occurrence of that integer 
from the corresponding record S[n d — 1] and expand the integer over S[0 . . .n d + n c — 1] 
sequentially right to left backwards. Then find the next node which corresponds to the 
record at S[n d — 2]. Continue until all the integers are retrieved and expanded over 
S[0 . . .n d + n c - 1]. 

Dl. initialize % = n — 1, j = n d — 1 and k = n d + n c , 

D2. if MSB of S[i] is 0, then it is not a node. Hence, decrease i and repeat this step; 
D3. otherwise, S[i] is a node. Hence, the position % of the node cues the recall of the 



integer using the inverse of Eqn. 13.11 Get the number of occurrence of the retrieved 
integer from its record (w — 1 bits of S[j}) and copy the integer as many as it occurs 
starting with w — 1 bits of S[k — 1] and decreasing k one for each copied integer. At 
the end, clear MSB of S[i], decrease i, j and k. If k = 0, then exit. Otherwise goto 
step lD2l 

Sequential Version After retrieval phase, n' d integers of S2 are clustered disorderly at 
S[n d +n c . . . n— 1]. Hence, the same algorithm can be called to sort this new list S[n' d . . . n— 
1] with its minimum 5' found at step IA3I 

Remark 3.1. Improved associative sort technique is on-line in the sense that after each 



retrieval phase ( [Algorithm D[ ), n d + n c integers are added to the sorted permutation at 
the beginning of the list and ready to be used. 

Remark 3.2. Recursive version is not possible as long as the tag bits are not clustered 
during storing phase. 

Complexity of the algorithm depends on the range and the number of integers. In 
each iteration the algorithm is capable of sorting the integers that satisfy S[i] — 5 < n. 
Hence, given uniformly distributed n integers S[0 . . .n — 1] each in the range [0, n — 1], 
the complexity is 0(n). 

Best Case Complexity. Given n integers S[0 . . . n — 1], if n — 1 of them satisfy S[i] — 5 < 
n, then these are sorted in 0(n) time. In the next step, there is one integer left which 
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implies sorting is finished. As a result, time complexity of the algorithm is lower bounded 
by fl(n) in the best case. 

Worst Case Complexity. Given n integers S[0 . . .n — 1] and m = (3n, if there is only 
1 integer available in practiced interval at each iteration until the last, in any jth step, 
the only integer s that will be sorted satisfies s < jn — (j — 1) which implies that the last 
alone integer satisfies s < jn — (j — 1) < (5n from where we can calculate j by j < ^5p 
In this case, the time complexity of the algorithm is, 

0(n) + 0(n - 1) + . . . + 0(n - j) = (j + \)0{n) - O(f) < (0 + l)0(n) (3.2) 

Therefore, the algorithm is upper bonded by ((3 + l)0{n) = 0{m + n) in worst case. 

Average Case Complexity. Given n integers S[0 . . . n— 1], if m — fin and the integers 
are uniformly distributed, this means that j| integers satisfy S[i] < n. Therefore, the 
algorithm is capable of sorting ^ integers in 0(n) time during first pass. This will continue 
until all the integers are sorted. The sum of sorted integers in each iteration can be 
represented with the series, 

n ntf-l) n((3 - l)^ 1 

+ p + ' ' ' + W k [ } 

It is reasonable to think that the sorting ends when one term is left which means the sum 
of k terms of this series is equal to n — 1, from where we can calculate the number of 
iteration or dept of recursion k which is valid when > 1 by, 

n p k 

It is seen from Eqn. 13.41 that when m = 2n, i.e., = 2, number of iteration or dept of 
recursion becomes k = logn and the complexity is the recursion T(n) = T(S) 4- 0(n) 
yielding T(n) = 0(n). It is known that each step takes 0(n) time. Therefore, the time 
complexity of the algorithm is, 

0W+0 (^) + ... + (^J^) (3,) 

from where we can obtain by defining x = , 

0(n) (1 + x + x 2 + x 3 + • • • + x fc - x ) = 0(n)(— < (30(n) (3.6) 

1 — x 1 — x 

which means that the algorithm is upper bounded by (30(n) or 0(m) in the average case. 
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4 Conclusions 



In this study, in-place associative integer sorting technique is improved. The improved 
version can only run sequentially. Recursive version is not possible. While the original 
technique was capable of sorting integers that satisfy S[i] — 5 + e < n where 5 = min(S') 
and e G [0, §], the improved version sorts the integers that satisfy S[i] — 5 < n. Hence, 
given n integers S[0, . . . n — 1] each in the range [0, n — 1] will be sorted exactly in 0(n) 
time while the complexity of the former technique was the recursion T(n) = T(~ ) + 0(n) 
yielding T(n) = 0(n). 

Using the technique, the main difficulties of distributive sorting algorithms are solved 
by its inherent three basic steps: (i) practicing, (ii) storing and (iii) retrieval which are 
three main stages in the formation and retrieval of memory in cognitive neuroscience. The 
technique is very simple and straightforward and around 30 lines of C code is enough. 

The technique sorts the integers using only 0(1) extra space in 0(n + m) time for 
the worst, 0(m) time for the average (uniformly distributed integers) and 0(n) time for 
the best case. It shows similar characteristics with bucket sort and distribution counting 
sort. However, it is time-space efficient than both. The ratio — defines the efficiency 
(time-space trade-offs) letting very large lists to be sorted in-place. Furthermore, the 
dependency of the efficiency on the distribution of the integers is 0(n) which means it 
replaces all the methods based on address calculation, that are known to be very efficient 
when the integers have known (usually uniform) distribution and require additional space 
more or less proportional to n. Hence, associative sort asymptotically outperforms all 
content based sorting algorithms. 

The only drawback of the algorithm is that it is unstable. But, an imaginary subspace 
can create other subspaces and associations using the idle integers that were already prac- 
ticed by manipulating either their position or value or both. Hence, different techniques 
can be developed on subspaces to solve other problems such as stability. 
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